40 research outputs found

    An ICT infrastructure to integrate clinical and molecular data in oncology research

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The ONCO-i2b2 platform is a bioinformatics tool designed to integrate clinical and research data and support translational research in oncology. It is implemented by the University of Pavia and the IRCCS Fondazione Maugeri hospital (FSM), and grounded on the software developed by the Informatics for Integrating Biology and the Bedside (i2b2) research center. I2b2 has delivered an open source suite based on a data warehouse, which is efficiently interrogated to find sets of interesting patients through a query tool interface.</p> <p>Methods</p> <p>Onco-i2b2 integrates data coming from multiple sources and allows the users to jointly query them. I2b2 data are then stored in a data warehouse, where facts are hierarchically structured as ontologies. Onco-i2b2 gathers data from the FSM pathology unit (PU) database and from the hospital biobank and merges them with the clinical information from the hospital information system.</p> <p>Our main effort was to provide a robust integrated research environment, giving a particular emphasis to the integration process and facing different challenges, consecutively listed: biospecimen samples privacy and anonymization; synchronization of the biobank database with the i2b2 data warehouse through a series of Extract, Transform, Load (ETL) operations; development and integration of a Natural Language Processing (NLP) module, to retrieve coded information, such as SNOMED terms and malignant tumors (TNM) classifications, and clinical tests results from unstructured medical records. Furthermore, we have developed an internal SNOMED ontology rested on the NCBO BioPortal web services.</p> <p>Results</p> <p>Onco-i2b2 manages data of more than 6,500 patients with breast cancer diagnosis collected between 2001 and 2011 (over 390 of them have at least one biological sample in the cancer biobank), more than 47,000 visits and 96,000 observations over 960 medical concepts.</p> <p>Conclusions</p> <p>Onco-i2b2 is a concrete example of how integrated Information and Communication Technology architecture can be implemented to support translational research. The next steps of our project will involve the extension of its capabilities by implementing new plug-in devoted to bioinformatics data analysis as well as a temporal query module.</p

    Big Data as a Driver for Clinical Decision Support Systems: A Learning Health Systems Perspective

    Get PDF
    Big data technologies are nowadays providing health care with powerful instruments to gather and analyze large volumes of heterogeneous data collected for different purposes, including clinical care, administration, and research. This makes possible to design IT infrastructures that favor the implementation of the so-called "Learning Healthcare System Cycle," where healthcare practice and research are part of a unique and synergic process. In this paper we highlight how "Big Data enabled" integrated data collections may support clinical decision-making together with biomedical research. Two effective implementations are reported, concerning decision support in Diabetes and in Inherited Arrhythmogenic Diseases

    Careflow Mining Techniques to Explore Type 2 Diabetes Evolution

    No full text
    In this work we describe the application of a careflow mining algorithm to detect the most frequent patterns of care in a type 2 diabetes patients cohort. The applied method enriches the detected patterns with clinical data to define temporal phenotypes across the studied population. Novel phenotypes are discovered from heterogeneous data of 424 Italian patients, and compared in terms of metabolic control and complications. Results show that careflow mining can help to summarize the complex evolution of the disease into meaningful patterns, which are also significant from a clinical point of view

    Prognostic Potential of Immune Inflammatory Biomarkers in Breast Cancer Patients Treated with Neoadjuvant Chemotherapy

    No full text
    Immune inflammatory biomarkers are easily obtained and inexpensive blood-based parameters that recently showed prognostic and predictive value in many solid tumors. In this study, we aimed to investigate the role of these biomarkers in predicting distant relapse in breast cancer patients treated with neoadjuvant chemotherapy (NACT). All breast cancer patients who referred to our Breast Unit and underwent NACT were retrospectively reviewed. The pre-treatment neutrophil-to-lymphocyte ratio (NLR), platelet-to-lymphocyte ratio (PLR), monocyte-to-lymphocyte ratio (MLR), and pan-immune-inflammation value (PIV) were calculated from complete blood counts. The primary outcome was 5-year distant-metastasis-free survival (DMFS). In receiver operating characteristic analyses, the optimal cutoff values for the NLR, PLR, MLR, and PIV were determined at 2.25, 152.46, 0.25, and 438.68, respectively. High levels of the MLR, but not the NLR, PLR, or PIV, were associated with improved 5-year DMSF in the study population using both univariate (HR 0.52, p = 0.03) and multivariate analyses (HR, 0.44; p = 0.02). Our study showed that the MLR was a significant independent parameter affecting DMFS in breast cancer patients undergoing NACT. Prospective studies are required to confirm this finding and to define reliable cutoff values, thus leading the way for the clinical application of this biomarker

    Patient-Generated Health Data Integration and Advanced Analytics for Diabetes Management: The AID-GM Platform

    No full text
    Diabetes is a high-prevalence disease that leads to an alteration in the patient&rsquo;s blood glucose (BG) values. Several factors influence the subject&rsquo;s BG profile over the day, including meals, physical activity, and sleep. Wearable devices are available for monitoring the patient&rsquo;s BG value around the clock, while activity trackers can be used to record his/her sleep and physical activity. However, few tools are available to jointly analyze the collected data, and only a minority of them provide functionalities for performing advanced and personalized analyses. In this paper, we present AID-GM, a web application that enables the patient to share with his/her diabetologist both the raw BG data collected by a flash glucose monitoring device, and the information collected by activity trackers, including physical activity, heart rate, and sleep. AID-GM provides several data views for summarizing the subject&rsquo;s metabolic control over time, and for complementing the BG profile with the information given by the activity tracker. AID-GM also allows the identification of complex temporal patterns in the collected heterogeneous data. In this paper, we also present the results of a real-world pilot study aimed to assess the usability of the proposed system. The study involved 30 pediatric patients receiving care at the Fondazione IRCCS Policlinico San Matteo Hospital in Pavia, Italy

    Clinical timelines development from textual medical reports in Italian

    No full text
    Patients diagnosed with chronic conditions are visited multiple times over the years. The medical reports produced during these visits often include valuable knowledge in the form of free text. To help physician access and review this knowledge, natural language processing and aggregation techniques are needed. In this work, we propose a system that extracts and summarizes information from medical reports written in Italian. For each patient, the system builds and visualizes a timeline of the extracted events. The proposed approach has the potential to enhance the process of reviewing patient clinical histories, reducing the time needed to access large amounts of data. In the future, an extension of the visualized timeline and an extrinsic evaluation will be performed. © 2017 IEEE
    corecore